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Abstract. An extended formulation of a polytope P is a system of linear inequalities and 
equations that describe some polyhedron which can be projected onto P. Extended formulations 
of small size (i.e., number of inequalities) are of interest, as they allow to model correspond- 
ing optimization problems as linear programs of small sizes. In this paper, we describe several 
aspects and new results on the main known approach to establish lower bounds on the sizes of 
extended formulations, which is to bound from below the number of rectangles needed to cover 
the support of a slack matrix of the polytope. Our main goals are to shed some light on the ques- 
tion how this combinatorial rectangle covering bound compares to other bounds known from the 
literature, and to obtain a better idea of the power as well as of the limitations of this bound. 
In particular, we provide geometric interpretations (and a slight sharpening) of Yannakakis' 1351 
result on the relation between minimal sizes of extended formulations and the nonnegative rank 
of slack matrices, and we describe the fooling set bound on the nonnegative rank (due to Diet- 
zfelbinger et al. (7)) as the clique number of a certain graph. Among other results, we prove that 
both the cube as well as the Birkhoff polytope do not admit extended formulations with fewer 
inequalities than these polytopes have facets, and we show that every extended formulation of a 
d-dimensional neighborly polytope with fl{d^) vertices has size Q,(cf ). 



1. INTRODUCTION 

An extended formulation of a polytop43 P C M'^ is a system of linear inequalities and linear 
equations defining a polyhedron Q C along with an affine map vr : — > M'^ satisfying 
7r((5) = P. Note that by translating P and/or Q, we may safely assume that vr is linear. The size 
of the extended formulation is the number of inequalities in the system (the number of equations 
is ignored, since one can easily get rid of them). As in this setting linear optimization over P can 
be done by linear optimization over Q, one is interested in finding small (and simple) extended 
formulations. Fig. [T] illustrates an extended formulation of size 6 in of a regular 8-gon. In 
fact, every regular 2'^-gon has an extended formulation of size 2k ll3j|2Tl, which can, e.g., be 
exploited in order to approximate second-order cones. 

However, several polytopes associated with combinatorial optimization problems have sur- 
prisingly small extended formulations (for recent survey articles we refer to ||5J|33j[T9l). Among 
the nicest examples are extended formulations of size 0{tr') for the spanning tree polytopes 
of complete graphs with n nodes (due to Martin EVl ) and of size 0{n\ogn) for the permu- 
tahedron associated with the permutations of n elements (due to Goemans |[T4l ). It may not 
be very surprising that no polynomial size extended formulations of polytopes associated with 
NP-hard optimization problems like the traveling salesman polytope are known. However, the 
same holds, e.g., for matching polytopes to this day. While it may well be that we simply still 



Date: July 10, 2012. 

'For all basic concepts and results on polyhedra and polytopes we refer to 1371 . 

1 



2 



FIORINI, KAIBEL. PASHKOVICH, AND THEIS 



TT 




Figure 1 . A regular 8-gon as a projection of a cube. 

have missed the right techniques to construct polynomial size extended formulations for the lat- 
ter polytopes, most people would probably agree that for the first ones we simply still miss the 
right techniques to prove lower bounds on the sizes of extended formulations of concrete poly- 
topes. Note, however, that RothvoB ||29l recently established, by an elegant counting argument, 
the existence of 0/1 -polytopes P in such that the size of every extended formulation of P is 
exponential in d. 

There is a beautiful approach due to Yannakakis Il35l for deriving lower bounds on the sizes of 
extended formulations that share symmetries of the polytope to be described. In fact, Yannakakis 
proved that neither the traveling salesman polytopes nor the matching polytopes associated with 
complete graphs admit polynomial size extended formulations that are invariant under permuting 
the nodes of the graph. These techniques can, for instance, be extended to prove that the same 
holds for the polytopes associated with cycles or matchings of logarithmic size in complete 
graphs, which do, however, have non-symmetric extended formulations of polynomial size [22}, 
as well as to show that the permutahedra do not have symmetric extended formulations of sub- 
quadratic size ll28l . 

Asking for the smallest size xc{P) of an arbitrary extended formulation of a polytope P, 
one finds that it suffices to consider extended formulations that define bounded polyhedra Q, 
i.e., polytopes (see ifTOl . llT9l ). Indeed, if Q is an unbounded polyhedron that is mapped to 
the polytope P by the linear map vr, then the recession cone of Q is contained in the kernel 
of TT (as 'ir{Q) = P is bounded). Thus, the pointed polyhedron Q' = Q n L with L being 
the orthogonal complement of the lineality space of Q satisfies tt{Q') = P as well. If Q' is 
bounded, we are done. Otherwise, choosing some vector a that satisfies (a, x) > for all non- 
zero elements of the recession cone of Q' and some /3 G M such that /3 > {a,v) holds for all 
vertices v of Q', we have that Q" := {x e Q' : {a, x) = /?} is a polytope with niQ") = P. 

Therefore, defining an extension of a polytope P to be a polytope Q along with an affine 
projection that maps Q to P, the extension complexity xc(P) of P equals the minimal number 
of facets of all extensions of P (where we can even restrict our attention to full dimensional 
extensions). Note that since a polytope is the set of all convex combinations of its vertices, it has 
a simplex with the same number of vertices as an extension, showing that xc(P) is at most the 
minimum of the numbers of vertices and facets of P. 

In the same paper Il35l where he established the above mentioned lower bounds on symmetric 
extended formulations for matching and traveling salesman polytopes, Yannakakis essentially 
also showed that the extension complexity of a polytope P equals the nonnegative rank of a 



COMBINATORIAL BOUNDS ON NONNEGATIVE RANK AND EXTENDED FORMULATIONS 3 

slack matrix of P, where the latter is a nonnegative matrix whose rows and columns are indexed 
by the facets and vertices of P, respectively, storing in each row the slacks of the vertices in 
an inequality defining the respective facet, and the nonnegative rank of a nonnegative matrix M 
is the smallest number r such that can be written as a product of two nonnegative matrices 
with r columns and r rows, respectively. (See |[25l . or O and the references therein.) Such a 
nonnegative factorization readily induces a covering of the set supp(M) of non-zero positions 
of M by r rectangles, i.e., sets formed as the cartesian product of subsets of the row and of 
the column indices. Thus, the smallest number of rectangles that cover supp(M) (the rectangle 
covering number of M) is a lower bound on the nonnegative rank of M, and hence, on the 
extension complexity of P if M is the slack matrix of P. Rectangle covering is the same 
concept as covering the edges of a bipartite graph by bicliques (e.g., HI), and it also coincides 
with factorizing a Boolean matrix (e.g., |[6l[T6l). 

In fact, almost all techniques to find lower bounds on the extension complexity of polytopes 
are combinatorial in the sense that they yield lower bounds on the rectangle covering numbers 
of slack matrices rather than exploiting the true numbers in the matrices. The main goal of 
this paper is to present current knowledge on the concept of rectangle covering numbers in 
particular with respect to its relation to the extension complexity of polytopes and to add several 
results both on lower bounds on rectangle covering number (providing lower bounds on the 
extension complexity of certain polytopes) as well as on upper bounds (revealing limitations 
of this approach to find lower bounds on the extension complexity of polytopes). Though the 
paper is written from the geometric point of view, we hope that it may also be useful for people 
working on questions concerning the nonnegative rank from a more algebraic point of view. 

The paper starts by describing (in Section |2]) the relation between rectangle coverings and ex- 
tensions both in the pure combinatorial setting of embeddings of face lattices as well as via the 
characterization of the extension complexity as the nonnegative rank of slack matrices. Here, 
we also give geometric explanations of Yannakakis' algebraic result relating extensions and 
nonnegative factorizations. We then give some results on upper bounds on rectangle covering 
numbers. For instance, we show that the rectangle covering number of slack matrices of poly- 
topes with n vertices and at most k vertices in every facet is bounded by 0(/c^ log n) (Proposi- 
tion l3.2l ). In Section|4]we briefly describe the connection between the rectangle covering number 
of a nonnegative matrix and deterministic as well as nondeterministic communication complex- 
ity. While the relation to deterministic communication complexity theory had already been used 
by Yannanakis |[35l, recent results of Huang and Sudakov |[T8l on nondeterministic communi- 
cation complexity have interesting implications for questions concerning extension complexity 
as well. In Section |5] we then present several techniques to derive lower bounds on rectangle 
covering numbers, as well as some new results obtained by using these techniques. We start by 
proving that the logarithm of the number of faces of a polytope P is not only a lower bound 
on its extension complexity (as already observed by Goemans HH) but also on the rectangle 
number of a slack matrix of P. Observing that the rectangle covering number of a nonnegative 
matrix is the chromatic number of the rectangle graph of the matrix, we interpret the. fooling set 
bound (see Dietzfelbinger et al. tV]) as the clique number of that graph. Exploiting that bound, 
we show, for instance, that for both the cube as well as for the Birkhoff polytope the extension 
complexity equals the number of facets (Propositions 15.91 and 15.101 ). However, we also show that 
for no d-dimensional polytope the clique bound can yield a better lower bound on the extension 
complexity than {d + 1)^. We also treat (in Section I541 ) the relation between the independence 
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ratio of the rectangle graph and the concept of generalized fooling sets (due to Dietzfelbinger et 
al. 171), and we prove that for d-dimensional neighborly polytopes with Q,{cP) vertices the rec- 
tangle covering number (and thus the extension complexity) is bounded from below by 

We close this introduction with a few remarks on notation. We only distinguish between row 
and column vectors in the context of matrix multiplications, where, as usual, vectors are meant 
to be understood as column vectors. For a matrix M, we denote by Mj ,t and M^^j the vectors in 
the i-th row and in the j-th column of M, respectively. All logarithms are meant to refer to base 
two. 

2. Projections and Derived Structures 

2.1. Lattice Embeddings. Let us denote by C{P) the face lattice of a polytope P, i.e., the set 
of faces of P (including the non-proper faces and P itself) partially ordered by inclusion. The 
following proposition describes the relation between the face lattices of two polytopes one of 
which is a projection of the other. The statement certainly is not new, but as we are not aware of 
any explicit reference for it, we include a brief proof. An embedding of a partially ordered set 
{S, ^) into a partially ordered set (T, !^) is a map / such that u ^vif and only if f{u) C f{v). 
Notice that every embedding is injective. 

Proposition 2.1. If Q and P CM.'^ are two polytopes, and vr : — )• M'^ is an affine map 
with tt{Q) = P, then the map that assigns h{F) := Qr\ tt~^{F) to each face F of P defines an 
embedding of C{P) into C{Q). 

Proof. For any face F of P defined by an inequality {a, x) ^ /3, we have (a, 7r(?/)) ^ /3 for all 
y ^ Q with equality if and only if TT{y) G F holds. Thus, h{F) is indeed a face of Q, defined 
by the linear inequality {a, vr(y)) ^ /3. Moreover, h is clearly an embedding. □ 

Remark 2.2. From the definition of h, we see that h{F n G) = h{F) n h{G) holds for all faces 
F, G. We call a lattice embedding h with this property meet-faithful. Clearly, not all lattice 
embeddings ai^e meet-faithful. 

Figure |2] illustrates the embedding from Proposition 12. 1 1 In the figure, P is a 4-dimensional 
cross-polytope, Q is a 7-dimensional simplex and vr is any affine projection mapping the 8 ver- 
tices of Q to the 8 vertices of P. Denoting by the ith unit vector, we have Q = conv{0, ei , . . . , e/} C 
MJ, P = convjei, — ei, . . . , €4, —©4} C and for instance 7r(0) = (Ei, 7r(ei) = — ei, . . . , 
7r(e7) = —©4. As the figure suggests coiTcctly, constructing a small extended formulation for a 
polytope P means to hide the facets of P in the fat middle part of the face lattice of an extension 
with few facets. 

2.2. Slack Representations. Let P be a polytope in W^. Let Ax ^ 6 be a system of linear 
inequalities such that 

P = {x£R'^ : Ax s^b}, 
and let m denote the number of inequalities involved in Ax ^ b. The slack vector of a point 
2; G M'' is the vector b — Ax S M™, and the slack map of P w.r.t. Ax ^ 6 is the affine map 
fj : M'' — )• M."^ that maps each vector to its slack vector. We call the polytope P := (t{P), which 
is the image of P under the slack map a, the slack representation of P w.r.t. Ax ^ b. Clearly the 
affine hull aff (P) of P is the image of the affine hull aff (P) of P under a. We call aff (P) the 
slack space of P w.r.t. Ax ^ b. Note that the polytope P is affinely isomorphic to the polytope 
P and P = aff (P) n M!p. 
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Figure 2. Embedding of the face lattice of the 4-dimensional cross-poly tope 
into the face lattice of the 7-dimensional simplex. 

A set of r nonnegative vectors T := {ti, . . . , t,.} C M™ is called a slack generating set of P 
w.r.t. Ax ^ 6 of size r if every point in P can be expressed as a nonnegative combination of the 
vectors in T. Then the following system 

r 

^ Afctfc G aff (P) and Afc ^ for all k G [r] 

provides an extended formulation of P of size r via the projection map A i— )• Afctfe. Since 

P and P are affinely isomorphic, this also yields an extension of P of size r, which is called a 
i/ac^ extension of P w.r.t. j4x ^ h. 

Lemma 2.3. Tff/je inequality {c,x) ^ (5 w valid for a polytope P := {x G M'^ | ^ 6} 
with dim(P) ^ 1, it is a nonnegative combination of the inequalities of the system Ax ^ b 
defining P. 

Proof. By Farkas's lemma, because P 7^ 0, there exists 6' ^ 5 such that (c, x) ^ 5' is a 
nonnegative combination of the inequalities of Ax ^ b. Now let i be an index such that 
m.m{{Ai^i,,x) I 2: G P} < max{(^j^v,, x) | a; G P} = 64. Such an index exists be- 
cause dim(P) ^ 1. Letting b'- := min{(Aj^^, | a; G P}, we see that the inequality 
— {Ai,^,x) ^ —6- is valid for P. By Farkas's lemma, because of the minimaUty of b'-, this 
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last inequality is a nonnegative combination of the inequalities of Ax ^ b. By adding the in- 
equality (ylj ,t,x) ^ bi and then scaling by (6j — b'-)~^, we infer that ^ 1 is a nonnegative 
combination of the inequalities of Ax ^ b. After scaling by 6 — 5' and adding the inequality 
(c, x) ^ 6', we conclude that (c, x) ^ 5 is a nonnegative combination of the inequalities of 

Ax^b. □ 

Lemma 2.4. Lef P := {x e Mf^ \ Ax ^ b} be a poly tope with dim(P) ^ 1. If the poly tope P 
has / facets, then P has a slack generating set of size f w.r.t. Ax ^ b. 

Proof. Let m and r denote the total number and the number of nonredundant inequalities of 
Ax ^ b respectively. W.l.o.g., assume that these r nonredundant inequalities come first in the 
system Ax ^ b. By Lemma [231 each of the m — r last inequalities of Ax ^ 6 is a nonnegative 
combination of the r first inequalities of Ax ^ b. This implies that, for each i G [m] \ [r], there 
are nonnegative coefficients j G M+ such that 

r r 

= ^ Ak,Jk,i and = ^ b^tk^i ■ 

k=l k=l 

By letting j := 1 if i = k and tk,i := for i G [r], the above equations hold for all i G [m]. 
This defines a set T := {ii, . . . , tr} of nonnegative vectors tk G M™, A; G [r]. 

To show that T is a slack generating set, consider for any point x £ P the nonnegative 
combination 

r 

^{bk-{Ak,^,x))tk. (1) 

k=l 

For i G [m] , the i-th coordinate of this vector equals 

r r r 

^(6fe - (^fc,*,a;))tfc,i = '^htk,i - {^Ak,i,tk,i,x) =bi- 
fe=i fe=i fc=i 

thus ^ equals the slack vector of x. 

Moreover, in ([T|) the coefficients corresponding to inequalities that hold with equality for all 
points of P can be chosen to be zeroes. Thus the corresponding vectors can be removed from 
r, resulting in a slack generating set of size /. □ 

Theorem 2.5. The extension complexity of a polytope P with dim(P) ^ 1 is equal to the 
minimum size of a slack generating set of P w.r.t. any given system Ax ^ b defining P. 

Proof. For an extension Q C M"^ of P of size r we have to show that there is a slack generating 
set of the polytope P of size r. Since the polytopes P and P are affinely isomorphic, Q is an 
extension of the polytope P, i.e there exists an affine map vr : M*^ — such that vr(Q) = P. 
Let Cj G W^, i G [m] and G M be such that 7r{y)i = Qi — {ci,y). Note that the inequality 
{ci,y) ^ gi is vahd for Q, because ^{Q) C P C M™. 

Now, consider a system Cy ^ g of n linear inequalities describing Q such that the first m 
inequalities of this system are (ci, y) ^ gi, .. ., {cm, y) ^ gm, and a corresponding slack map 
T : — )• M". Again, Q := t{Q) is affinely isomorphic to Q. Therefore, Q is also an extension 
of P. Indeed, the projection p : M" — to the first m coordinates maps Q onto P, because 
vr = p o r (see Figure |3]l. 
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Q ^ Q 
P ^ P 

Figure 3. The four polytopes P, P, Q and Q and the maps between them. 

By Lemma 12.41 there is a slack generating set {ti, . . . , t^} ^ 1^+ for Q of size r. We 
claim that stripping the last n — m coordinates of these vectors result in a slack generating 
set {p{ti), . . . , p{tr)} of the polytope P. 

Indeed, consider a point x £ P and its slack vector a{x) G P. Because P C tt{Q) there is a 
point y £ Q such that 7r(y) = a{x). Because {ti, . . . , tr} is a slack generating set of Q, there 
exist nonnegative coefficients Ai, . . . , G M+ such that r(y) = Afeifc. From linearity of 

p and vr = p o T, we have 

(r \ ^ 

A;=l / fc=l 

and thus P has a slack generating set {p{ti), . . . , p{tr)} of size r. □ 

Note that in Theorem |23] one may take the minimum over the slack generating sets w.r.t. any 
fixed system of inequalities describing P. In particular, all these minima coincide. 

2.3. Non-Negative Factorizations. Again, let P = {x ^M.'^ : Ax ^ 6} be a polytope, where 
A G M""^^, b G R"\ Let y = {wi, . . . , v„} C denote any finite set such that 

P = conv(y) . 

The slack matrix of P w.r.t. Ax and V is the matrix S = (Sij) G R'^^"' with 

Si,j = bi- {Ai^^, Vj) for all i G [m], j G [n] . 

Note that the slack representation P C of P (w.r.t. Ax ^ 6) is the convex hull of the 
columns of S. 

If the columns of a nonnegative matrix T G M™^'' form a slack generating set of P, then there 
is a nonnegative matrix U G M^^" with 5 = TU. Conversely, for every factorization S = TU 
of the slack matrix into nonnegative matrices T G M^^*^ and U G ]R!j_^", the columns of T form 
a slack generating set of P. 

Therefore, due to Theorem 12. 5 [ constructing an extended formulation of size r for P amounts 
to finding a rank-r nonnegative factorization of the slack matrix S, that is a factorization S = 
TU into nonnegative matrices T with r columns and U with r rows. In particular, the following 
result follows, which is essentially due to Yannakakis ||35l (he proved that extension complexity 
and nonnegative rank are within a factor of two of each other, when the size of an extension is 
defined as the sum of the number of variables and number of constraints defining the extension). 

Theorem 2.6 (see Yannakakis II35II ). The extension complexity of a polytope P (which is neither 
empty nor a single point) is equal to the nonnegative rank of any of its slack matrices. In 
particular, all the slack matrices of P have the same nonnegative rank. 
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Remark 2.1. It is obvious from the definition that the extension complexity is monotone on 
extensions: if polytope Q is an extension of polytope P, then xc((5) ^ xc(P). From Theorem 
12.61 we see immediately that the extension complexity is also monotone on faces: if polytope P 
has F as a face, then xc(P) ^ xc(F) because we can obtain a slack matrix of F from a slack 
matrix of P by deleting all columns that correspond to points of P which are not in F. 

A nonnegative matrix T is the first factor in a nonnegative factorization of a slack matrix 
of P if and only if the columns of T form a slack generating set of P. In order to characterize 
the second factors of such nonnegative factorizations, let us consider an extension Q of P with 
projection vr and some set V = {vi, . . . , f„} with P = couv{V). A section is a map s : V ^ Q 
such that tt{s{x)) = x holds for all x ^ V. Clearly, every extension posseses a section map. 

Recall that (R")* denotes the dual vector space of M", which we here regard as the set of 
all row vectors of size n. For an inequality (c, y) ^ g that is valid for Q, we construct a 
nonnegative row vector in (M")!^ (the nonnegative orthant in (E")*), whose j-th coordinate 
equals g — (c, s{vj)) and call it the section slack covector associated with (c, y) ^ g w.r.t. set 
V. A set U of nonnegative row vectors in (IR")!!j_ is a complete set of section slack covectors 
if there is some extended formulation for P along with some section such that U is precisely 
the set of section slack covectors associated with the inequalities in the extended formulation. A 
nonnegative matrix U is the second factor in a nonnegative factorization of a slack matrix of P if 
and only if the rows of U form a complete set of section slack covectors (both w.r.t. to the same 
set V with P = couv{V)). From this, one in particular derives (again using Farkas's Lemma) 
the following characterization, where the slack covector associated with some valid inequality 
(a, x) ^ 6 for P = conv(y) is the nonnegative row vector in (M")^ whose j-th coordinate is 
h- {a,Vj). 

Proposition 2.8. A set U CI (M");^ is a complete set of section slack covectors for P = coiw{V) 
(with 1 ^ \V\ < oo) if and only if every slack covector associated with a valid inequality for P 
can be expressed as a nonnegative combination of the elements ofU. 

2.4. Rectangle Coverings. According to Theorem 12.61 finding lower bounds on the extension 
complexity of a polytope amounts to finding lower bounds on the nonnegative rank of its slack 
matrices. Not surprisingly, determining the nonnegative rank of a matrix is a hard problem from 
the algorithmic point of view. Indeed, it is NP-hard to decide whether the nonnegative rank of a 
matrix equals its usual rank B4il (where, of course, the first is never smaller than the second). 

One way to bound the nonnegative rank rank+(M) from below is to observe that a nonnega- 
tive factorization M = TU, where T has r columns and U has r rows, yields a representation 

r 

k=l 

of M as a sum of r nonnegative rank-1 matrices ^[7^ ,t. Denoting by supp(-) the support of 
a vector or of a matrix (i.e., the subset of indices where the argument has a nonzero entry), we 
obviously have 

r 

supp(M) = y supp(r^,fcC/fc,^) 

k=l 

since both T and U are nonnegative. 
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A rectangle is a set of the form IxJ, where / and J are subsets of the row respectively column 
indices of M. A rectangle covering of M is a set of rectangles whose union equals supp(Af ). It 
is important to notice that all the rectangles in any rectangle covering of M are contained in the 
support of M. The rectangle covering number of M is the smallest cardinality rc(M) of any 
rectangle covering of M. Clearly, we have 

rc(M) ^ rank+(M) (2) 

for all nonnegative matrices M since 

supp(r^,fcC/fc,*) = supp(r^,fc) X supp([/fc,^) 

holds for each A; S [r] . In particular, when M = 5 is some slack matrix of the polytope P 
(neither empty nor a single point), then by Theorem 12. 61 we have 

Tc[S) ^ rank+(5') = xc(P) . 

Like rank+(5) = xc(P), the rectangle covering number rc(5) is actually independent of the 
actual choice of the slack matrix S of P, see Lemma l2. lOl below. 

Let us call the support matrix suppmat(5) of 5 the 0/1-matrix arising from S by replacing 
all nonzero-entries by ones. Clearly, we have rc(5) = rc(suppmat(5)). The rectangles that can 
be part of a rectangle covering of 5 are called 1-rectangles because any such rectangle induces 
a submatrix of suppmat(S') that contains only one-entries. 

Furthermore, any 0/1-matrix whose rows are indexed by some set Fi, . . . , Fm of faces of P 
including all facets and whose columns are indexed by some set Gi , . . . , G„ of nonempty faces 
of P including all vertices such that there is a one-entry at position {Fi,Gj) if and only if 
face Fi does not contain face Gj is called a non-incidence matrix for P. Associating with every 
inequality in Ax ^ b the face of P it defines and with every point in V the smallest face of P it 
is contained in, one finds that the set of support matrices of slack matrices of P equals the set of 
non-incidence matrices of P (up to adding/removing repeated rows or columns). 

Clearly, adding a row or a column to a nonnegative matrix does neither decrease the rectan- 
gle covering number nor the nonnegative rank. The following result on the rectangle covering 
number is easy to see. 

Lemma 2.9. The rectangle covering number remains unchanged if one adds a row or a column 
whose support is the union of the supports of some existing rows or columns, respectively. 

Now, we can prove that all non-incidence matrices, and thus all slack matrices, of a polytope 
(of dimension at least 1) have the same rectangle covering number. 

Lemma 2.10. All non-incidence matrices of a polytope (which is neither empty nor a single 
point) have the same rectangle covering number. 

Proof. Letting P denote the polytope in the statement of the lemma, consider a non-incidence 
matrix M for P. Every proper face F of P contains exactly those faces of P that are contained 
in all facets of P that contain F. For the non-incidence matrix M, this implies that the support 
of the row corresponding to F is exactly the union of the supports of the rows corresponding to 
the facets containing F. 

Similarly, every nonempty face G of P is contained in exactly those faces of P in which all 
vertices of G are contained. This implies that the support of the column corresponding to G is 
exactly the union of the supports of the rows corresponding to the vertices contained in F. 



10 



FIORINI, KAIBEL, PASHKOVICH, AND THEIS 



The result then follows from Lemnia |2!9] because the rectangle covering number of M equals 
the rectangle covering of the submatrix of M corresponding to the facets and vertices of P, and 
thus does not depend on M. (Notice that there might be a row in M for F = P, but this row is 
identically zero.) □ 

Denoting by rc(P) the rectangle covering number of any non-incidence matrix for P, we 
have 

TC{P) ^ xc(P) . (3) 
When studying the rectangle covering number, we may freely choose the non-incidence matrix 
we consider. The most natural choice is the facet vs. vertex non-incidence matrix, since it appears 
as a submatrix of every non-incidence matrix. (The rows of this 0/1-matrix are indexed by the 
facets, and the columns ai^e indexed by the vertices. The entry coiTcsponding to a facet-vertex 
pair is 1 if and only if the facet does not contain the vertex.) Nevertheless, we will sometimes 
consider non-incidence matrices with more rows. 

2.5. Rectangle Coverings, Boolean Factorizations and Lattice Embeddings. As mentioned 
before, a rank-r nonnegative factorization of a nonnegative matrix M can be regarded as a de- 
composition of M into a sum of r nonnegative rank-1 matrices. Similarly, we can regard a 
rectangle covering of a Boolean (or 0/1-) matrix M with r rectangles as a rank-r Boolean fac- 
torization, that is a factorization M = TU expressing M as the Boolean product of two Boolean 
matrices T with r columns and U with r rows. Furthermore, a rank-r Boolean factorization of a 
Boolean matrix M is equivalent to an embedding of the relation defined by M into the Boolean 
lattice 2[''l (that is, the set of all subsets of [r], partially ordered by inclusion), in the sense of the 
following lemma. 

Lemma 2.11. Let M be amxn Boolean matrix. Then M admits a rank-r Boolean factorization 
M = TU if and only if there are functions f : [m] — >• 21^1 and [n] — )■ 2^ such that Mij = 
if and only if g{j) C f{i)forall (ij) G [m] x [n]. 

Proof. This follows immediately by interpreting the ith row of the left factor T as the incidence 
vector of the complement of the set f{i) and the jth column of the right factor U as the incidence 
vector of the set (7 (j). □ 

From Lemma l2.1 1[ we can easily conclude a first lattice-combinatorial characterization of the 
rectangle covering number rc(P) of a polytope P by taking M to be a non-incidence matrix of 
P. 

Theorem 2.12. Let P be a polytope with dim(P) ^ 1. Then rc(P) is the smallest r ^ 1 such 
that C{P) embeds into 2^. 

For a given poset, the smallest number r such that P embeds into the Boolean poset 2['] is 
known as the 2-dimension of the poset (see e.g., ifTTIl and the references therein). We note that, 
independently, Gouveia et al. |[T5l have found a similar connection. 

Proof of the Theorem \2.12\ Let Fi, . . . , Fm denote the facets of P , \et vi, . . . , Vn denote the 
vertices of P and let M denote the facet vs. vertex non-incidence matrix for P. 

Suppose first that rc(P) ^ r, that is, M has a rank-r Boolean factorization. From maps / 
and g as in Lemma |2. 11 [ we define a map h from C{P) to 2[^1 by letting h{F) := Hi FCF /(*)• 
It is clear that F (1 G imphes h{F) C h{G). Now assume h{F) C h{G). Pick a vertex Vj of F. 
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We have Mij = and thus g{j) C f{i) for all facets Fi containing F. Hence, g{j) C h{F). 
Because h{F) C h{G), we have g{j) C f{i) and thus Mij = for all facets Fi containing G. 
Since Vj is arbitrary, this means that every facet containing G contains all vertices of F, which 
implies F <^ G. Therefore, h is an embedding. 

Next, if h is an embedding of £{P) into 2^, we can simply define f{i) := h{Fi) and g{j) := 
h{{vj}). These maps satisfy the conditions of Lemma |2. 11 [ hence M has a rank-r Boolean 
factorization and rc(P) ^ r. The theorem follows. □ 

We immediately obtain a second lattice-combinatorial characterization of the rectangle cov- 
ering number of a polytope. 

Corollary 2.13. The rectangle covering number rc(P) of a polytope P with dim(P) ^ 1 is 
equal to the minimum number of facets of a polytope Q into whose face lattice C{Q) the face 
lattice C{P) of P can be embedded. 

Corollary 12. 13 [ follows from Theorem |TT2] and Lemma [2. 14l below. 

Lemma 2.14. IfQ is a polytope with r ^ 2 facets then there is an embedding of C{Q) into 2^. 

Proof. We denote by Gi, . . . , Gr the facets of Q, and define a map h from C{Q) to 2^ by 
letting h{G) := {/c G [r] | G ^ Gk} for all faces G of Q. It is easily verified that h is an 
embedding. □ 

The embedding C{P) — )■ C{Q) given by Corollary 12.131 is not always meet-faithful. It is 
unclear whether requiring that the embedding be meet-faithful would give a (much) better bound. 

Remark 2.15. Similarly to the extension complexity, the rectangle covering number is easily 
shown to be monotone on extensions and on faces. If the polytope Q is an extension of the 
polytope P, then we may infer, e.g., from Proposition 12. II and Corollary 12. 131 that tc{Q) ^ 
rc(P). Moreover, by reasoning on the slack matrices directly, we see that rc(P) ^ rc(F) 
whenever polytope P has F as a face. 

3. Upper Bounds on the Rectangle Covering Number 

In this section, we discuss some examples of polytopes for which small rectangle coverings 
can be found, as well as methods for constructing such rectangle coverings. These examples 
illustrate cases where the rectangle covering bound shows its limitations. Cases where the rec- 
tangle covering bound can be successfully applied to obtain interesting lower bounds on the 
extension complexity are discussed in the Section [S] 

3.1. The Perfect Matching Polytope. The perfect matching polytope Pmatch(?^) is defined as 
the convex hull of characteristic vectors of perfect matchings in the complete graph K'^ = 
{Vn, En). A linear description of the perfect matching polytope Pmatch(f^) is as follows |[9l : 

Pmatch(n) = {x G M^" : x{5{v)) = 1 for all v G 

x{5{S)) ^ lforall5CK,3^ |S| |5|odd 
Xe ^ for all e G P„ }, 

where 5{S) denotes all edges in the graph with exactly one endpoint in S, 5{v) := 5{{v}), 
and x{F) := J^e^F^e. for all edge sets F C P„. Currently, no polynomial-size extension 
is known for the perfect matching polytope Pmatch(^)- Moreover, it was shown that under 
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certain symmetry requirements no polynomial-size extension exists |[35ll22l . On the other side 
it is possible that a non-symmetric polynomial-size extension for the perfect matching polytope 
can be found |[22ll . In this context any non-trivial statement about the extension complexity of 
Pmatch("-) is interesting. 

The non-incidence matrix M (w.r.t. the above system of constraints and the characteristic 
vectors of perfect matchings) of the perfect matching polytope has Q{v?) rows corresponding to 
the non-negative constraints and 0(2") rows corresponding to the inequalities indexed by odd 
subsets of vertices of K". A facet corresponding to some odd set S and a vertex corresponding 
to some matching are non-incident if and only if the matching has more than one edge in 6{S). 

Lemma 3.1. The rectangle covering number rc(Pmatch("-)) is O(n^). 

Proof. The non-zero entries in rows corresponding to the non-negative constraints can be triv- 
ially covered by O(n^) 1 -rectangles of height one. Less obvious is the fact that the non-zero 
entries corresponding to the odd-subset inequalities can be covered by O(n^) 1 -rectangles |[35| . 
For this one considers the rectangles i?ei,e2 = -^ei,e2 ^ Jei,e2 indexed by unordered pair of edges 
ei, 62, where the set Iei,e2 consists of all odd sets S such that ei, 62 G 6{S) and the set Jei,e2 
consists of all the matchings containing both edges 61,62- □ 

Thus in the case of Pmatch('^) we cannot obtain any lower bound better than O(n^) solely by 
reasoning on rectangle coverings. 

3.2. Polytopes with Few Vertices on Every Facet. We consider a polytope P with n vertices 
such that every facet of P contains at most k vertices, and its facet vs. vertex non-incidence 
matrix M. Each row of M has at most k zeros, and at least n — k ones. We now prove that the 
rectangle covering number of such a polytope is necessarily small. 

Proposition 3.2. If P is a polytope with n vertices and at most k vertices on each facet, then 
rc(P) = ©(/c^logn). 

Before giving the proof, we state the following lemma that is the main ingredient of the proof. 

Lemma 3.3. Let M be the 0/1 matrix with rows indexed by {S [n] : \S\ ^ A;} and columns 
indexed by {T C [n] : \T\ ^ i}, where i ^ k ^ n, such that the entry Ms^t is non-zero if and 
only if the sets S, T are disjoint. Then rc(M) = 0{{k + £)6^{1 + k/iY log n). 

Proof. Consider rectangles ly x Jy, V C [n], where ly ■= {<§ C [n] : 5 C y, jS"! ^ k} and 
Jv ■= {T <^[n] : TnV = 0, |r| ^ £}. Obviously, every rectangle ly x Jy, V C [n] is 
contained in supp(M). 

Pick a set y C [n] by selecting the points from [n] independently with probability p, for some 
p G [0, 1]. For a fixed pair {S, T) of disjoint sets 5, T C [n] with |5| /c and |r| ^ the 
probability to be covered is at least 

q:=p^{l-pY, 

thus choosing the probability p equal to -^-n to maximize the value q we get 
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Now, let us bound the natural logarithm of the expected number of entries from supp(M) 
which are not covered if we choose independently r such rectangles. An upper bound on this 
quantity is: 



ln((n + l)'=(n+l)''(l-g)'') ^ (k + i)ln{n + 1) + rln{l - q) 

^{k + £) ln(n + 1) - rq 

^ {k + £) ln(n + 1) - re~ 



k + - 



If the above upper bound for the logarithm of the expected number of not covered entries from 
supp(M) is negative, we can conclude that there exists a rectangle cover for the matrix M of 
size r. Thus there exists a rectangle cover of size 0{{k + £)e^{l + k/^Y log n). □ 

Proof of Proposition \3.2\ Let M be the facet vs. vertex non-incidence matrix of P. It suffices to 
show that rc(Af) = 0(/c^ logn). We may extend M by adding extra rows in order to obtain a 
matrix in which each binary vector of size n with at most k zeros appears as a row. Obviously, 
this operation does not decrease rc(M). The result then follows from Lemma [331 by taking 
£:=l. □ 



The most natural case where Proposition 13.21 applies is perhaps when P is a simplicial d- 
dimensional polytope with n vertices. For such a polytope there is a rectangle covering of the 
non-incidence matrix with 0{d^ logn) rectangles. 

3.3. The Edge Polytope. The edge polytope Pedgc(G') of a graph G is defined as the convex 
hull of the incidence vectors in M^^*^) of all edges of G. Thus Pcdgc(G) is a 0/1-polytope with 
vertices in M^^"^). Consider a stable set S of G. Denoting by N{S) the neighborhood 
of S in G, we see that the inequality 

x{S) - x{N{S)) ^ (4) 

is valid for Pcdgc(G). It can be shown EOl that these inequalities, together with ^ for 
V G V{G) and x{V{G)) = 2, form a complete linear description of Pcdgc(G). 

3. 3. (a). A Sub-quadatric Size Extension for All Graphs. The following lemma provides an up- 
per bound not just on the rectangle covering number of Pcdgc(G), but also on the extension 
complexity of Pedge(G). 

Lemma 3.4. For every graph G with n vertices there exists an extension of the edge polytope 
PedgclG) of size 0{r?l log n). 

Proof. Let H denote a biclique (i.e., complete bipartite graph) with bipartition W , U. The 
edge polytope P{H) of H has a linear description of size 0{\V{H)\). Namely, in addition 
to the equalities x{W) = 1 and x{U) = 1, the linear description consists of the nonnegativity 
constraints x^, ^ for u G V{H). From any covering of the edges of G by bicliques Hi, . . . ,Ht 
(with Hi (1 G for all i), we can construct an extension of Pedge(G) of the size 0{t + |y(i?i)| + 
. . . + \V{Ht)\) using disjunctive programming Q. Since t ^ \V{Hi)\ + . . . + \V{Ht)\ and 
since there is such a covering with + . . . + = 0(n^/logn) |[32l the lemma 

follows. □ 
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3.3.(b)- Dense Graphs. For two matrices A and B of the same size, denote hy A . B the entry- 
wise (or Hadamard) product: {A . B)ij = AijBij. The following statements are easy to verify. 

Lemma 3.5. 

(a) For any nonnegative real matrices we have rank_|„(A . B) ^ rank_|_(A) rank+(i?). 

(b) For any real matrices we have ic{A . B) ^ rc(yl) ic{B). 

Let G be a graph with n vertices. We denote S{G) the set of all independent sets of G. The 
slack matrix of Pcdgc(G) has two type of rows: those which correspond to inequalities of the 
form (|4| for S stable in G, and those which correspond to nonnegativity inequalities ^ for 
V G V{G). Let M be the support of the submatrix of the slack matrix of Pcdge{G) induced by 

the rows con-esponding to stable sets S; let M^^^ be the S{G) x ii;(G)-matrix with M^l = 1 if 

e n S = and otherwise; and let M^^) be the S{G) x £'(G)-matrix with M^l^ = 1 if u or i; 
has a neighbor in S. Then we have 

M = m(^).m(2). (5) 
Lemma 3.6. For the matrix M^^-* defined above we have rc(M^^-') ^ n. 

Proof. For each v € V{G), define a 1 -rectangle ly x Jy. The set /„ consists of all stable sets 
that contain a neighbor of v, and the set J^, consists of all edges incident to v. These n rectangles 
define a rectangle covering ofM(2). □ 

Now, we can prove an upper bound on rc(P(G)). 

Proposition 3.7. Denoting by a{G) the stability number of G, the support of the slack matrix 
of the edge poly tope of G admits a rectangle cover of size 

0{a{Gfn\ogn). 

Proof Lemma 1331 for I := 2 and k := a{G) implies rc(M(i)) = 0{a{Gf logn). The result 
follows by combining Lemmas l3.5l and l3.6l □ 

Combining Turan's Theorem |[3TI . which states that |i?(G)| ^ , with Proposition 13.71 

we obtain the final result of this section: there exists a nontrivial class of graphs for which 
the rectangle covering number cannot prove a strong Q.{\E{G)\) lower bound on the extension 
complexity. 

Corollary 3.8. The support of the slack matrix of the edge polytope of G admits a rectangle 
cover of size o{\E{G)\) if 

«(G)=o((^)'"). ^ 

4. Rectangle Covering Number and Communication Complexity 

The aim of communication complexity is to quantify the amount of communication necessary 
to evaluate a function whose input is distributed among several players. Since its introduction 
by A. Yao in 1979 iMl . it became a part of complexity theory. It was successfully applied, e.g., 
in the contexts of VLSI design, circuit lower bounds and lower bounds on data-structures [241. 

Two players Alice and Bob are asked to evaluate a Boolean function / : ^ x i? — ^ {0, 1} at 
a given input pair (a, h) ^ Ax B, where A and B are finite sets. Alice receives the input a ^ A 
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and Bob receives the input b £ B. Both players know the function / to be evaluated, but none 
of the players initially has any information about the input of the other player. They have to 
cooperate in order to compute /(a, b). Both players can perform any kind of computation. It is 
only the amount of communication between them which is limited. In a deterministic protocol, 
Alice and Bob will exchange bits until one of them is able to correctly compute /(o, 6). The 
deterministic communication complexity of the function / is the minimum worst-case number 
of bits that Alice and Bob have to exchange in order to evaluate / at any given input pair (a, b). 

The function / can be encoded via its communication matrix M = M (/) that has Mj^- = 
/(oj, bj), where Oj is the i-th element of A and bj is the j-th element of B. It is known ll35l that 
a deterministic protocol of complexity k for / yields a decomposition of M as a sum of at most 
2^ rank-one 0/1-matrices, implying rank+(Af) ^ 2^. In order to apply this for constructing 
extensions, via Theorem 12.61 it is necessary for the slack matrix to be binary. 

Let G be a graph. The stable set poly tope Pstab(G) is the 0/1-polytope in R^^*^) whose ver- 
tices are the characteristic vectors of stable sets of G. Because the maximum stable set problem 
is NP-hard, it is unlikely that a complete description of Pstab(G) will be found for all graphs G. 
However, some interesting classes of valid inequalities for PstablC) ai^e known. For instance, 
for every clique K of G, the clique inequality ^^^x ^ valid for Pstab(G')- By collecting 
the slack covectors of each of these inequalities w.r.t. the vertices of PstablC), we obtain a 0/1- 
matrix M = M{G). This matrix is the communication matrix of the so-called clique vs. stable 
set problem (also known as the clique vs. independent set problem). Yannakakis ll35l found a 
0(log^ n) complexity deterministic protocol for this communication problem. When the clique 
inequalities together with the non-negativity inequalities form a complete linear description of 
Pstab(G), that is, when G is perfect ||4J, one obtains an extension of size 2'^^'°^ ") = 7^0{iogn) 
for Pstab(<J)^ where n denotes the number of vertices of G. For general graphs, M is not a slack 
matrix of Pstab(G) but a proper submatrix. Hence rank+(M) only gives a lower bound on the 
extension complexity of Pstab(G'). 

Consider again a Boolean function / : Ax B — ^ {0,1} and a deterministic protocol for 
computing /. Fix an input pair (a, b). Given the transcript of the protocol on input pair (a, b), 
each the players can verify independently of the other player if the part of the transcript that 
he/she is responsible for is correct. The transcript is globally correct if and only if it is correct 
for both of the players. Thus one way to persuade each player that, say, f{a,b) = 1 is to 
give them the transcript for (a, b). If the complexity of the given deterministic protocol is k, 
then the players can decide whether /(a, 6) = 1 or not based on a binary vector of size k (the 
transcript). This is an example of a nondeterministic protocol. We give a formal definition in the 
next paragraph. 

In a nondeterministic protocol, both players are given, in addition to their own input a and 
b, a proof that is in the form of a binary vector vr of size k (the same for both players). The 
players cannot communicate with each other. Each of them checks the proof vr, independently 
of the other, and outputs one bit, denoted by VA{a, vr) or Vb(6, vr). The result of the protocol is 
the AND of these two bits, that is, V{a, 6, vr) := VA{a, vr) A Vb(6, vr). The aim of the protocol 
is to prove that /(a, b) = 1. In order to be correct, the protocol should be sound in the sense 
that if f{a,b) = then y(a,6, vr) = for all proofs vr, and complete in the sense that if 
f{a,b) = 1 then V(a, 6, vr) = 1 for some proof vr. The nondeterministic complexity of f is 
the minimum proof-length A; in a nondeterministic protocol for /. Letting M = M{f) denote 
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the communication matrix of /, it is well-known that the nondeterministic complexity of / is 
riog(rc(M))], see e.g. El. 

While Yannakakis proved that the deterministic communication complexity of the clique vs. 
stable set problem is O(log^n), the exact deterministic and nondeterministic communication 
complexities of the clique vs. stable set problem are unknown. The best result so far is a 
lower bound of |logn — 0(1) on the nondeterministic complexity obtained by Huang and 
Sudakov ifTSl . They also made the following graph-theoretical conjecture that, if true, would 
improve this bound to Q{log'^ n). 

Recall that the biclique partition number of a graph G is the minimum number of bicliques 
(that is, complete bipartite graphs) needed to partition the edge set of G. Recall also that the 
chromatic number x{G) of G is the minimum number of parts in a partition of the vertex set of 
G into stable sets. 

Conjecture 4.1 (Huang and Sudakov HH). For each integer k > 0, there exists a graph G with 
biclique partition number k and chromatic number at least IP ^, for some constant c > 0. 

This conjecture would settle the communication complexity of the clique vs. stable set prob- 
lem. In terms of extensions, the result of Huang and Sudakov implies a ^(n^l^) lower bound 
on the worst-case rectangle covering number of stable set polytopes of graphs with n vertices. 
Moreover, if true, the Huang-Sudakov conjecture would imply a n^(^°s") lower bound. 

5. Lower Bounds on the Rectangle Covering Number 

In this section, we give lower bounds on the rectangle covering number, and apply them to 
prove results about the extension complexity of polytopes. Our general strategy is to focus on 
a specific submatrix of the slack matrix and then use simple structural properties of the support 
of the submatrix. Although much of the underlying geometry is lost in the process, we can 
still obtain interesting bounds, which are in some cases tight. Also, we compare the bounds to 
each other whenever possible and useful. We remark that most of the bounds discussed here 
are known bounds on the nondeterministic communication complexity of Boolean functions. 
Nevertheless, they were never studied in the context of polyhedral combinatorics. 

5.1. Dimension and Face Counting. 

Lemma 5.1. Suppose the 0/1-matrix M has h distinct rows. Then rc(M) ^ log h. 

Proof. Since deleting rows does not increase the rectangle covering number, we may assume 
that M has exactly h rows, which are all distinct. 

We proceed by contradiction, i.e., we assume that there is a collection TZ of rectangles that 
covers M and contains less than log h rectangles. Each such rectangle is of the form I x J, 
where / is a set of row indices and J is a set of column indices such that Mij = 1 whenever 
i € / and j G J. 

For every row-index i ^ [h], let TZi denote the set of all rectangles I x J from TZ such that 
i £ I. Since the number of rectangles in TZ is less than log h, there are two distinct row-indices i 
and i' such that TZi and TZii are equal. Because the rectangles in TZ form a covering of the matrix 
M, it follows that the i-th row and the i'-th row of M have the same set of one-entries and are 
thus equal, a contradiction. □ 
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Obviously, the same statement holds with "rows" replaced by "colunms". It is well-known |[T4ll 
that the binary logarithm of the number of faces of a polytope P is a lower bound for the num- 
ber of facets in an extension of P. This number is in fact also a lower bound on the rectangle 
covering number of P. Furthermore, the following chain of inequalities holds. 

Proposition 5.2. For a full-dimensional polytope P in with f faces, we have 

(i + 1 ^ log / rc(P). 

Proof. The first inequality follows by induction on d. For the second inequality, consider a slack 
matrix for P with / rows with the i-th row being the slack covector of any inequality defining 
the i-th face of P. The support matrix Af of this slack matrix has / distinct rows. The conclusion 
now follows from Lemma [5?T] □ 

5.2. The Rectangle Graph. Consider a real matrix M. The rectangle graph of M, denoted 
by G{M), has the pairs such that A/j j / as vertices. Two vertices and (A;,£) of 
G{M) are adjacent if no rectangle contained in supp(M) can be used to cover both (i, j) and 
{k,l). This last condition is equivalent to asking Afj ^ = or j = 0. The following result 
will allow us to interpret most lower bounds on rc(M) in terms of graphs. We omit the proof 
because it is straightforward. 

Lemma 5.3. For every real matrix M, we have ic{M) = x{G{M)). □ 

5.3. Clique Number and Fooling Sets. The clique number uj{G{M)) of the rectangle graph 
of a real matrix M is a lower bound to its chromatic number and hence, by Lemma 15.31 a 
lower bound on the rectangle covering number of M. A clique in G{M) corresponds to what is 
known as a. fooling set: A selection of non-zeros in the matrix such that no two of them induce 
a rectangle contained in supp(A/). As is customary, we denote the clique number of a graph G 
by ul}{G). Analogously, we denote the clique number of G{M) by uo{M). We can also define 
the clique number uj{P) of a polytope P as uj{M), for any non-incidence matrix M of P. The 
next proposition shows that uj(P) is well-defined. 

Proposition 5.4. The clique number uj{M) is independent of the non-incidence matrix M cho- 
sen for the polytope P. 

Proof. Consider any non-incidence matrix M for P and a fooling set of size q := uj{M). It 
suffices to show that the facet-vertex non-incidence matrix of P has a fooling set of size q as 
well. By reordering rows and columns of M if necessary, we may assume that the fooling set is 
(Fi, Gi), . . . , (F^, Gg). Thus, Fi, ...,Fg and Gi,...,Gg are faces of P satisfying: 2 Gi 
for all i G [q], and Fi 5 Gj or Fj ^ Gi for all i, j G [q] with i / j. We assume that our fooling 
set is chosen to maximize the number of Fj's which are facets, plus the number of Gj's which 
are vertices. 

If every Fi is a facet and every Gj is a vertex, we are done because then our fooling set is 
contained in the facet vs. vertex submatrix of M. Otherwise, w.l.o.g., there exists an index i 
such that Fi is not a facet. Let i' be such that Fj/ is a facet with Fj/ 5 Fj but at the same time 
Fj/ 2 Gi. If i' > q then we can replace {Fi,Gi) by (Fj/, Gi) in the fooling set, contradicting 
the choice of the fooling set. Otherwise, i' ^ q and the fooling set contains a pair of the form 
{Fit,Gi'). Since Fj/ ^ Gj/, we have Fj ^ Gi'. Moreover, Fj/ 2 Gi by choice of Fj/. This 
contradicts the fact that (Fj, Gj), . . . , (F^, Gg) is a fooling set. □ 
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In this section, by constructing large fooling sets, we can give lower bounds on the extension 
complexity for a number of examples, including cubes (in Section |5.3.(b)| ) and the Birkhoff 



polytope (in Section [5.3.(c)| ). But before we do that, we will take a look at the Umitations of 



the clique number as a lower bound for the rectangle covering number. We show that the clique 
number of the rectangle graph is always 0{d?) for d-dimensional polytopes, and 0{d) for simple 
poly topes. 

5. 3. (a). Limitations of the Clique Number as a Lower Bound on the Rectangle Covering Num- 
ber. Here we give some upper bounds on the sizes of fooling sets. In some cases, these bounds 
immediately render useless the fooling set approach to obtain a desired lower bound for the rec- 
tangle covering number or the extension complexity of a polytope. We start with an easy upper 
bound based on the number of zeros per row. 

Lemma 5.5. If every row of M contains at most s zeros, then oj[M) ^ 2s + 1. 

Proof. If M has a fooling set of size q, then the submatrix of M induced by the rows and columns 
of the entries belonging to the fooling set contains at least q{q — l)/2 zeros. In particular, one 
row of the submatrix has at least {q — l)/2 zeros. By hypothesis, {q — l)/2 ^ s, that is, 
g ^ 2s + 1. □ 

If every vertex of a polytope is contained in at most s facets, or if every facet of a polytope 
contains at most s vertices, then ijj{P) = 0{s). In particular, for simple or simplicial polytopes, 
the fooling set lower bound is within a constant factor of the dimension. 

We now give dimensional upper bounds on a;(M). Dietzfelbinger et al. Q show the follow- 
ing. 

Lemma 5.6 (Dietzfelbinger et al. Q). For every field K and for every 0/1 -matrix M, there is 
no fooling set larger than rank]K(M)^. 

Although the rank of the non-incidence matrix M of a polytope can be substantially larger 
than its dimension (this is the case already for polygons), we can nevertheless prove the follow- 
ing by techniques similar to those of f/l . 

Lemma 5.7. For every polytope P of dimension d, we have uj{P) ^ (d + 1)^. 

Proof. Without loss of generality, assume that P is full-dimensional. Let M be a non-incidence 
matrix for P = {x G M'^ : Ax ^ 6} = conv({t;i, . . . ,Vn}), and q := (^{P). By reordering 
if necessary, we may assume that (1,1),..., {q, q) are the vertices of a maximum clique of the 
rectangle graph of M. Let Zi := {bi, -Ai^^) G i = 1, . . . ,q, and tj := (1, Vj) G M''+\ 

j = I, . . . , q. These vectors have the following properties: 

{zi,tj)^0 foralH,j = l,...,g; 

{zi,ti)>0 for alH = 1, . . . ,0^; (6) 
{zi,tj) = or {zj,ti) = for all i, j = 1, . . . , q with i ^ j ■ 

Now consider the following 2q rank-one matrices: ZitJ for i = I, . . . ,q and tjzj for j = 
I, . . . ,q. Taking the usual inner product for matrices (^4, B) = J2ij ^ij^ij = Tr{A'^B), if 
i j,v/e have 

{zitf,tjzj) = Ti{tizJtjzJ) = Tr{zjtizftj) = {zjti){zftj) = {zj,ti){zi,tj) = 0. 
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but 

{zitj,tizf) = {zi,ti){zi,ti) > 0. 
This implies that the matrices zitj, . . . , Zqt^ are linearly independent. Since we have q linearly 
independent (d + 1) x (d + l)-matrices, we conclude that q ^ {d+ 1)^. □ 

Remark 5.8. From a construction due to Dietzfelbinger et al. Q, we can infer the existence of a 

log 4 

d-dimensional poly tope P with uj{P) = VL{d^°<^^). It is an open question which of these bounds 
can be improved. 

We conclude this subsection with an example where the fooling set or clique number bound 
is particularly bad. Consider the vertex-facet non-incidence-matrix of a convex polygon with n 
vertices. This is a 77, x n 0/1 -matrix M with A/j = if and only if i is equal to j or j + 1 
modulo n. We have a;(M) ^ 5, by Lemma [531 whereas rc(M) ^ log 77, by Proposition [ 



5.3.(b). The Cube. We now apply the fooling set technique to show that d-cubes are "min- 
imal extensions" of themselves. A combinatorial d-cube is a polytope whose face lattice is 
isomorphic to that of the unit cube [0, l]'^. Compare the result of this proposition to the lower 
bound one can obtain as the binary logarithm of the number of faces, Proposition 15.21 which is 
(ilog(3) « 1.585 d. 

Proposition 5.9. If P is a combinatorial d-cube then xc{P) = 2d. 

Proof. We obviously have a; (-P) ^ xc(P) ^ 2d. Below, we prove that equality holds throughout 
by constructing a fooling set of size 2d, implying uj{P) ^ 2d. Since uj{P) only depends on 
the combinatorial type of P, we may assume that P = [0, l]'^. To define our fooling set, 
we carefully select for each facet-defining inequality of P a con^esponding vertex. The vertex 
con^esponding to the inequality rr^ ^ is the vertex Vi defined as follows: 



1 if 1 ^ i ^ i 
if i<ji^d 



For the inequality Xi ^ 1, take the vertex Wi defined as follows: 

if 1 ^ i ^ i 



^"^'^^ \ I if i<j^d 

One can check that none of the facets of the cube is incident with its corresponding vertex, but 
that for any two facets at least one of the corresponding vertices is incident to one of them. □ 

5.3.(c). The Birkhoff Polytope. The ?ith Birkhoff polytope is the set of doubly stochastic n x n 
matrices or, equivalently, the convex hull of all 77 x 77 permutation matrices. Let P denote the 
77th Birkhoff polytope. For 77 = 1, P is a point (and xc(P) is not defined). For 77 = 2, P is 
a segment and therefore xc(P) = uj{P) = 2. For 77 = 3, P has 9 facets and 6 vertices, thus 
xc(P) ^ 6. As is easily verified, the facet vs. vertex non-incidence matrix for P has a fooling 
set of size 6. Hence, xc(P) = io{P) = 6. 

Proposition 5.10. For 77 ^ 4, the extension complexity of the Birkhoff polytope is ii?. 

Proof. The Birkhoff polytope P is defined by the nonnegativity inequalities Xjj ^ 0, 7,j E 
[r7]o = {0, . . . , 77— 1} (starting indices from zero will be a bit more convenient for our treatment), 
and the equations Xj j = 1, j G [770], = 1> ^ S ["-o]- This trivially implies that 
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xc(P) ^ n^. To give a lower bound on the extension complexity, we construct a fooling set of 
size in the facet vs. vertex non-incidence matrix of P. 

For this, for every inequaUty Xij ^ 0, we define one vertex Vij such that {vij)ij > by 
giving a permutation vr of [n]o. Namely, let 7r{i) := j and 7r(i + 1) := j + 1 (all indices are to 
be understood modulo n). Moreover, we take n^k) := i + j + I — k whenever k ^ {i,i + 1}. 
This defines a permutation, and thus a vertex Vij of P. 

Now, we show that this family of inequality-vertex pairs is a fooling set. By contradiction, let 
us assume that for two different inequalities Xij ^ and Xi'jt ^ we have both {vij)iij' = 1 
and {viiji)ij = 1. Letting vr and vr' denote the permutations of [n]o associated to Vij and Viiji 
respectively, this means that 7r(i') = / and 7r'(i) = j. Because 7^ (^'li')' we conclude 
that i + j + 1 — i' = j' or (i', j') = (i + 1, j + 1), and at the same time i'+j' + l — i = j 
or = {i' + + 1). Because 2/0 and 3 7^ modulo n, these conditions lead to a 
contradiction. □ 

The Birkhoff polytope can also be described as the convex hull of the characteristic vectors of 
all perfect matchings of a complete bipartite graph with 7i vertices on each side of the bipaitition. 
The nth bipartite matching polytope is the convex hull of all (not necessarily perfect) matchings 
of a complete bipartite graph with n vertices on each side. Let P denote the nth bipartite 
matching polytope. For 7i = 1, P is a segment and xc(P) = uj{P) = 2. For n = 2, P 
has 8 facets and 7 vertices, thus xc(P) ^ 7. We leave to the reader to check that ui{P) ^ 7, 
thus xc(P) = io{P) = 7. For n = 3, P has 15 facets and 34 vertices, thus xc(P) ^ 15. It can 
be checked that P has a fooling set of size 15, hence xc(P) = uo{P) = 15. (To the inequality 
Xij ^ we associate the matching {i + 1, j + 1)} if i ^ j and {k,£), {£,k)} 

if i = j and k, i denote the two other elements of {0, 1, 2}. To the inequality J2j ^ij ^ 1 we 
associate the matching {{i + l,i + 1), {i + 2, i)}. To the inequality J2i ^ij ^ 1 we associate 
the matching {{j + 1, j + 1), + 2)}.) As before, all computations are done modulo n = 3.) 
For n ^ 4, an argument similar to the one in the proof of Proposition 15.101 shows the following. 

Proposition 5.11. For n ^ 4, the bipartite matching polytope has extension complexity v? + 2n. 

Proof (sketch). Again, the given number is a trivial upper bound. To construct a fooling set of 
this size, for the nonnegativity inequalities, we take the perfect matchings constructed in the 
proof of the previous proposition. In addition, for the inequalities Xij ^ 1, we take the 
vertex Wj with = 1 for all k / j; for the inequalities ^ 1, we take the vertex 

Ui with fc+i = 1 for all k ^ i. □ 

5.4. Independence Ratio, Rectangle Sizes, and Generalized Fooling Sets. Denote by a{G) 

\G\ 

the maximum cardinality of an independent vertex set in G. The number is a lower bound 
on the chromatic number (|G| stands for the number of vertices in G.) Moreover, taking induced 
subgraphs may improve the bound: The number 

i{G) := max , 
ucv(G) a(G\U\) 

where G[U] denotes the subgraph of G induced by the vertices in U is sometimes called the 
independence ratio of G, and is also a lower bound on the chromatic number of G. 

In the context of the rectangle covering number, these bounds are known under different 
names. The following lemmas will make that clear. We leave the easy proofs to the reader. 
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Lemma 5.12. For a real matrix M, the maximum number of entries in a rectangle contained in 
supp(M) equals a{G{M)). □ 

The concept of generalized fooling sets has been proposed by Dietzfelbinger et al. iVj as 
a lower bound on the nondeterministic communication complexity of a Boolean function. In 
the graph-coloring terminology, a k-fooling set is an induced subgraph H of G{M) for which 
a{H) ^ k holds. If a /c-fooling set on s entries of supp(M) can be found, then, clearly, s/k 
is a lower bound on the rectangle covering number. Their generalized fooling set lower bound 
on the rectangle covering number is then just the supremum of all these fractions s/k, and it 
coincides with the independence ratio. 

Whereas the fooling set / clique lower bound can be arbitrarily bad (see the example at the 
end of Section |5.3.(a)| ), this is not the case for the independence ratio. The following is true 



for general graphs (Lovasz |^6] Thm. 7]). It has been rediscovered by Dietzfelbinger et al. Q 
for the rectangle graphs of 0/1-matrices. Lovasz' argument (specialized from the more general 
setting of hypergraph coverings he in fact considers) proceeds by analyzing the following greedy 
heuristic for coloring G: pick a maximum stable set in G, then a maximum stable set S2 in 
G — Si, and so on. Denoting by the maximum number of nodes in G containing no stable set 
of size larger than k and by ti the number of stable sets of exactly size i produced by the greedy 
heuristic, one finds that Yli=i ^ ^fe holds for all /c = 1, . . . , a = a{G). Adding up these 
inequalities scaled by l/k{k + 1) for A; < a and by 1/a for A; = a one obtains the upper bound 
Ym=i i(iTTy^« + = 0[i{G) log(|G|)) on the number Yld=i of stable sets generated by 
the greedy procedure. 

Lemma 5.13. For all graphs G, we have x{G) = 0(i(G) log(|G|)). □ 



5.4.(a). The Cube Revisited. We now give an alternative proof of Proposition 15.91 based on the 
independence number of G{P) instead of the clique number. 

2nd Proof of Proposition The maximal rectangles contained in supp(M) of the facet vs. 
vertex non-incidence matrix M are among those of the form I x J, where / is a collection of 
facets of P that does not contain a pair of opposite facets and J = J(/) is the set of those 
vertices of P that belong to none of the facets in /. If q denotes the cardinality of /, the size 
of such a rectangle is precisely q ■ 2'^^'^. Indeed, q of the coordinates of the vertices of J are 
determined by J, while the other d — q coordinates are free, which implies \J\ = 2'^~'^. 

As is easily verified, q2'^~'^ is maximum for q G {1,2}. Hence, the maximum size of a 
rectangle in supp(M) is thus 2'^"^. It follows that 2d rectangles are necessary to cover all the 
ones in M, because there are 2d ■ 2^~^ ones in M. □ 

5.4.(b). Polytopes with Few Vertices on Every Facet. As we have seen in Lemma [53] above. the 
chances of obtaining a good lower bound based on fooling sets are poor if the matrix has few 
zeros in each row. A similar result is true for generalized fooling sets. 

Lemma 5.14. Let M be a real matrix with at most s zeros per row and let U be a k-fooling set 
in M. Then \ U\/k, namely, the lower bound on rc(M) given by U, is at most 2k'^s. 

Proof. Let [/ be a set of vertices of G{M) which is a A:-fooling set, i.e. the subgraph H of G{M) 
induced by U has a{H) ^ k. By Turan's theorem, we have 
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Each edge of H links two entries of M in distinct rows and columns such that at least one 
of the two other entries in the 2x2 rectangle spanned by these entries is zero. We say that 
this (these) zero(s) are responsible for the edge. Thus for each edge of H there is at least one 
responsible zero (and at most two responsible zeros), located in the rectangle spanned by U. 
Since every row or column contains at most k elements of U, each zero in the rectangle spanned 
by U is responsible for at most k"^ edges of H. Hence, the number of edges in H is at most 
k"^ times the number of zeros in the rectangle spanned by U. Thus, since U covers at most \U\ 
rows, which together contain at most \ U\s zeros, we have 

\E{H)\^k'\U\s, 

and we conclude that 

\TT\ „ 
^ 2k^s. 



k 

□ 

5.5. Fractional Chromatic Number. Another well-studied lower bound for the chromatic num- 
ber of a graph is the fractional chromatic number. This number can be defined in several equiva- 
lent ways. The most convenient is probably the following: x* (G) is the solution of the following 
linear program: 

min ^ xs (7a) 
3 

Xs ^ I for every vertex v of G (7b) 

S: v£S 

Xs ^ for every stable set S of G, (7c) 

where the sums extend over all stable sets S of the graph G. In the context of nondeterministic 
communication complexity, this bound is well-known, see for example Il23l or |[24l and the 
references therein. In particular, it is known that x{M) = 0(x*(G(M) log |G(M)|)). The 
following known fact from graph coloring (see e.g. |[30l p. 1096]) improves this slightly. 

Lemma 5.15. For all graphs G, it is true that ^ (1 + In a{G))x* (G). □ 

Clearly, if integrality conditions are imposed on the variables in the linear program (|7]) defin- 
ing x*{G), then the optimal value is the chromatic number x{G)- 

5.6. Neighborly d-Polytopes with Q{cf) Vertices. Suppose d is even and let k := d/2. Con- 
sider a neighborly polytope P in with vertex set 1/ := vert(P). Let n := \V\ denote the 
number of vertices of P. We assume that n = Q{d'^) = 0(A;^), that is, k and d are both Q{y/n). 

Consider a non-incidence matrix that has one row for each facet, as well as one row for each 
{k — l)-face, and one row per vertex. Let M denote the submatrix induced on the rows that 
correspond to the {k — l)-faces. Thus the rows of M are indexed by the A;-subsets of V (that is, 
the vertex sets of {k — l)-faces of P), and the columns of M are indexed by the elements of V 
(that is, the vertices of P). The entry of M for fc-set F (1 V and vertex fGFislifu^F 
and otherwise. It follows from Lemma 12.101 and the monotonicity of the rectangle covering 
number on submatrices that xc(P) ^ ic{M). 

Proposition 5.16. Let n,k be positive integers and M denote the (^) x n matrix defined above. 
Then Tc{M) = n (mm {n, (k+i){k+2) ^y 
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Proof. The maximal rectangles of M are of the form I x J, where J is a subset of V of size at 
most n — k and / = I (J) is the collection of all A;-sets F such that F C V — J. We define a 
cover as a collection J' = {Ji, Jt} of subsets of V, each of size at most n — k, such that the 
corresponding rectangles cover M. This last condition can be restated as follows: for each pair 
{F,v) with V ^ F there exists an index £ G [t] with v ^ Ji and F V — Ji. 

Consider a cover J' = {Ji, . . . , Jt}. For a vertex v £ V, consider the collection J^{v) of sets 
in J' that contain v. There are two cases. First, it could be that {v} G J^{v). In this case, by 
removing v from the other sets in J{v), we may assume that J{v) = {{f }}. That is, the other 
sets of J do not contain v. Otherwise, all sets in J{v) contain at least one element distinct from 
V. In this case, it should not be possible to find a fc-set F contained mV — {v} that meets all sets 
in J{v), because otherwise J would not cover the pair (F, v). In particular, there are at least 
k + 1 sets in J{v). 

Let X denote the set of the vertices v such that {v} G J, and let s := \X\. Pick distinct 
vertices vi, V2, . . . , Vk+i in V — X. This can be assumed to be possible because otherwise 

V — X has at most k vertices. If y — X is empty, then clearly the cover is trivial and t ^ n. 
Else V — X contains an element v and, because all sets in J^{v) are contained inV — X, one 
can easily find a fc-set F C y — {v} such that {F, v) is not covered by J'. 

For all £ G [A; + 1], it holds that the number of sets in J' that contain ve and none of the 
vertices vi, .. ., V£^i is at least k + 2 — i. Otherwise, we could find k + 1 — £ elements in 

V — {vi, . . . , Ve} that, added to {vi, . . . , ve-i}, would define a /c-set F C V — {v^} such that 
{F, Vi) is not covered by J. 

Now, the number of sets in J' is at least 

N , N (k + l)(k + 2) (k + l)(k + 2) 

s + {k + l) + k + {k-l) + ... + l = s+ ^ ' ^ ^ ■ 

The first term counts the number of singletons in J', the second the number of sets containing 
vi , the third the number of sets containing V2 but not vi , and so on. □ 

When n = Q{k'^), it follows from Proposition 15. 16l that the minimum number of rectangles 
needed to cover M is 0(n). Therefore, in this case the minimum number of facets in an exten- 
sion of P is 0(n). Note that the binary logarithm of the total number of faces is in this case 
0(dlogn) = Q{dlogd). In conclusion, neighborly d-polytopes with vertices give a fam- 

ily of polytopes such that the rectangle covering number is super-hnear in both the "dimension" 
and the "binary logarithm of the number of faces" bounds. 

6. Concluding Remarks 

We conclude this paper with some open problems. We begin by reiterating the first open 
problem in Yannakakis's paper |[35l . Bounding the rectangle covering number seems to be 
the currently best available approach to bound the extension complexity of specific polytopes. 
(The new bound on the nonnegative rank proposed by Gillis and Glineur |[T3l does not lead 
to improvements for slack matrices.) Can one find other bounds? In particular, for a given 
polytope P, consider the smallest number of facets of a polytope Q into whose face lattice the 
face-lattice of P can be embedded meet-faithfully. Does this number improve substantially on 
the rectangle covering bound? Or is it always bounded by a polynomial in the rectangle covering 
bound for P? 
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For some polytopes the rectangle covering number even yields optimal or near-optimal lower 
bounds on the extension complexity as we have demonstrated in this paper for cubes, Birkhoff 
polytopes, and neighborly d-polytopes with Q{(P) vertices. However, for other polytopes such 
as the perfect matching polytope, the rectangle covering bound seems rather useless. For in- 
stance, for sufficiently irregular n-gons in M^, the extension complexity is bounded from below 
by Q{y/n), while the rectangle covering number is 0(log n) lfT2l . 

In all the examples we have found so far, the rectangle covering number is always polynomial 
in d and log n, where d denotes the dimension and n the number of vertices. Can one find 
polytopes for which the rectangle covering number is super-polynomial in d and log n? 

As mentioned above, it is true that uj{M) ^ (rank A + 1)^ for every matrix A whose support 
is M, and Dietzfelbinger et al. Q constructed a family of matrices M with rank(M)^°s ^/ ^ ^ 
Lo{M). Which of the two bounds can be improved? 

In view of our result on the extension complexity of neighborly d-polytopes with n vertices, 
it is natural to ask whether the bound Q{d'^) can be improved for d = o{y/n). 

Finally, in the absence of better lower bounds on the extension-complexity, it would be inter- 
esting to know the rectangle covering bound of the perfect matching polytope. As mentioned 
above, it is O(n^). Even a small improvement on the trivial lower bound Q.{n?') would be inter- 
esting. 

7. Final Note 

After submitting this paper, our third open problem was solved by Fiorini, Massar, Pokutta, 
Tiwary and de Wolf ifTTIl . They prove among other things that the rectangle covering number of 
the cut polytope is super-polynomial in the dimension and logarithm of the number of vertices 
of the polytope. 
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